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DETAILED ACTION 

1 . This Office Action is in response to correspondence filed October 20, 2008 in 
reference to application 09/459,380. Claims 1-12, 14-25, and 27-54 are pending and 
have been examined. 

Continued Examination Under 37 CFR 1.114 

2. A request for continued examination under 37 CFR 1.114, including the fee set 
forth in 37 CFR 1 .17(e), was filed in this application after final rejection. Since this 
application is eligible for continued examination under 37 CFR 1.114, and the fee set 
forth in 37 CFR 1 .17(e) has been timely paid, the finality of the previous Office action 
has been withdrawn pursuant to 37 CFR 1.114. Applicant's submission filed on October 
20, 2008 has been entered. 

Response to Amendment 

3. The amendments filed October 20, 2008 have been accepted and considered in 
this office action. Claims 1,4, 10, 17, 20, 23- 47, and 48 have been amended, and 
claims 13 and 26 have been cancelled. 

Response to Arguments 

4. Applicant's arguments with respect to claims 1 ,4-5,8-1 0,22-23,28-29,47 and 48 
have been considered but are moot in view of the new ground(s) of rejection. 
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5. Applicant's arguments filed October 20, 2008 with respect to claims 30, 37, 41 , 
44, 49, and 50 have been fully considered but they are not persuasive. 

6. With regards to applicant's arguments, see Remarks pages 31-32, that neither 
McDonough or Furui teaches the limitations of "analyzing a voice message to 
determine if the voice message exhibits a "predetermined pattern of speech," where the 
predetermined pattern of speech represents "at least one of a tone of speech in the 
voice message and a frequency of the speech in the voice message"" the examiner 
respectfully disagrees. While applicant correctly asserts on page 32 that HMMs 
generally operate on a phoneme based level and do not detect broader emotional 
patterns such as urgency, it is noted that all that is required in the claims is determining 
a pattern in speech from tone of frequency. It is noted that although the claims are 
interpreted in light of the specification, limitations from the specification are not read into 
the claims. See In re Van Geuns, 988 F.2d 1181, 26 USPQ2d 1057 (Fed. Cir. 1993). 
As claims must be given the broadest reasonable interpretation during examination, one 
can fairly ready the claims to read that the patterns on a phoneme by phoneme level as 
described in the working of HMM models taught by Furui. While the specification does 
suggest using patterns that encompass more than a phoneme, this is not reflected in 
the language of the claims. 

Claim Rejections - 35 USC §112 

7. The following is a quotation of the first paragraph of 35 U.S.C. 112: 
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The specification sliall contain a written description of tlie invention, and of tlie manner and process of 
mailing and using it, in sucli full, clear, concise, and exact terms as to enable any person skilled in the 
art to which it pertains, or with which it is most nearly connected, to make and use the same and shall 
set forth the best mode contemplated by the inventor of carrying out his invention. 

8. Claims 4 and 20 are rejected under 35 U.S.C. 112, first paragrapli, as failing to 
comply with the written description requirement. The claim(s) contains subject matter 
which was not described in the specification in such a way as to reasonably convey to 
one skilled in the relevant art that the inventor(s), at the time the application was filed, 
had possession of the claimed invention. Claims 4 and 20 have been amended to 
require that the user specified word or phrase is performed after receiving the voice 
message. However upon examination, no support could be found in the specification 
(of claims as originally filed to support this amendment. 

Claim Rejections - 35 USC § 103 

9. The text of those sections of Title 35, U.S. Code not included in this action can 
be found in a prior Office action. 

l\^cDonouali and Epstein 

1 0. Claims 1 , 4-1 0, 1 4-1 7, 20-23, 26-29, 47, 48,and 51 -52 are rejected under 35 
U.S.C. 103(a) as being unpatentable over McDonouqh et al. [US Patent 5,625,748] in 
view of Epstein et al. [US Patent 6,327,343], both already of record. 
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1 1 . Regarding claim 1 , McDonougli describes tlie embodiment for processing 
untranscribed speecli by describing tlie content and functionality of the recited 
limitations recognizable as a whole to one versed in the art as the following terminology: 

voice representations and voice messages [at column 6, lines 23-29, as 
untranscribed speech data]; 

storing voice, corresponding to a word or phrase [at column 2, lines 1-17, as 
training words to the vocabulary, and at column 5, lines 47-48, as a vocabulary of words 
and phrases for speech events]; 

each voice representation is associated with a value [at column 6, lines 41-42, as 
parameter values for individual event distributions]; 

storing actions [at column 2, lines 14-17, as create a new node associating an 
action with a word]; 

receive a voice message [at column 1, lines 53-54, as provide an input speech 
message]; 

selecting a user specified word or a user specified phrase by a user, the selected 
user specified word or phrase corresponding to a word or phrase having a 
corresponding stored voice representation (column 12 line 13 events are selected by 
human operator from list of possible events, see figure 5 as well.) 

analyze the voice message to determine if one or more stored voice 
representations corresponding to the selected user word of phrase occur in the 
message [at column 5, lines 43-50, as process a spoken message to produce a signal 
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for the potential speecli events in tlie spol<en data If user selection is used as described 
column 12 line 13, then it is inherent that the models will correspond to the selection]; 

generate a final criteria measurement value associated with the voice message 
[at column 7, lines 28-44, as summing confidence scores over the speech data]; 

the final criteria measurement value based on the value associated with each 
determined stored voice representation occurring in the voice message [ column 6 line 
4-42, models are trained and parametric probabilistic models and parameter values are 
developed for stored representations. Column 6 line 1 the topic classifier uses model 
parameters determined in training. Therefore it is inherent that the confidence scores 
will be determined in part by this probabilistic parameters.]; 

perform one (or more) action(s) if the stored voice representations are found in 
the voice message [at column 2, lines 1-8, as route the message according to the action 
associated with the word]; 

performing the (stored) action based on the final criteria measurement value [at 
column 12, lines 28-41, as sort, classify or route based on the topic, wherein at column 
5, line 64-column 6, line 1 the topic choice is a confidence score that a topic is present]. 

McDonouqh does not specifically teach that the selected word is received from 
the user as opposed to be being selected only. 

In the same field of topic determination, Epstein teaches that a user can input 
key words into the device as a part of programming (column 12 lines 18-37.) 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
of the invention to combine the input means of Epstein as a way of performing the 
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selection in McDonougli in order to allow the user to select the words without having to 
user cumbersonne lists or menus. 

12. Claim 4 is set forth including the limitations of claim 1 . McDonough and Epstein 
describes those limitations as indicated there. McDonough also describes additional 
limitations as follows: 

after receiving the voice message, receiving the user-specified word or phrase 
from the user (column 12 lines 27-42, topic analysis can be carried out on recordings, 
which could obviously be stored in the system before phrases are selected by the 
user.); and 

after receiving the user-specified word or phrase from the user, performing the 
step of analyzing (this order is inherent for the device to operate. The model parameters 
must be determined before analyzing can take place). 

13. Claim 5 is rejected using the same rationale as in the previous Office action that 
was mailed November 20, 2002 as paper 3, and is reproduced here: 

Claim 5 is set forth including the limitations of claim 1 . McDonough and Epstein 
describes those limitations as indicated there. McDonough also describes additional 
limitations as follows: 

the user specifying actions to be performed if the voice representation is found in 
the voice message [at column 2, lines 1-24, as the user specifies the correctness of the 
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action associated witli tlie word to route tlie message according to tlie action associated 
witli tlie word]; 

storing tlie user specified actions [at column 2, lines 1-24, as the user specifies 
the correctness of the action to create a new node associating an action with a word]; 

the user specified actions are included in performing the stored actions [at 
column 2, lines 1-24, as route the message according to the action associated with the 
word for which the user specifies the correctness of the action associated with the 
word]. 

14. Claim 6 is rejected using the same rationale as in the previous Office action 
(mailed November 20, 2002 as paper 3), and reproduced here: 

Claim 6 is set forth including the limitations of claim 1 . McDonough describes and 
make obvious those limitations as indicated there. McDonough [at column 12, lines 40- 
41] also describes classifying stored voice messages. 

McDonough . however, does not explicitly describe classifying the message as 

urgent. 

Epstein [at column 8, lines 23-34] also describes processing a voice message as 
the embodiment for stored audio data. Epstein describes: 

marking the message as urgent [at column 17, line 40, as adding an urgency 
stamp]. 

Although McDonough describes classifying message, McDonough's does not 
enumerate any particular classifications. In view of Epstein's labeling a message as 
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urgent, it would liave been obvious to one of ordinary sl<ill in tlie art of message 
[Handling at the time of invention to include Epstein's concept of marking as urgent as a 
classification for McDonough's messages because that would have enabled signaling 
the addressee that an urgent message is available. 

15. Claim 7 is rejected using the same rationale as in the previous Office action 
(mailed November 20, 2002 as paper 3), and reproduced here: 

Claim 7 is set forth including the limitations of claim 1 . McDonough describes and 
make obvious those limitations as indicated there. McDonough [at column 12, lines 36- 
41] also describes routing a phone call based on the message. 

McDonough . however, does not explicitly describe calling a pager, [at column 8, 
lines 23-34] also describes processing a voice message as the embodiment for stored 
audio data. Epstein describes: 

calling a pager [at column 4, lines 1-3, as transmit a message to the user's 

pager]. 

Although McDonough describes routing calls and messages, McDonough does 
not enumerate any particular terminal type for receiving the message. In view of 
Epstein's transmission to a pager, it would have been obvious to one of ordinary skill in 
the art of message handling at the time of invention to include Epstein's ability to call a 
pager for McDonough's messages because that would have enabled signaling the 
addressee when the user is not at home or is out of the office, as Epstein describes [at 
column 14, lines 47-48]. 
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16. Claim 8 is rejected using tlie same rationale as in the previous Office action that 
was mailed November 20, 2002 as paper 3, and is reproduced here: 

Claim 8 is set forth including the limitations of claim 1 . McDonough and Epstein 
describes and make obvious those limitations as indicated there. Because 
McDonough's embodiments are directed equally to either processing of phone calls or 
processing of stored messages, McDonough describes: 

forwarding the voice message [at column 12, lines 36-41, as routing a phone call 
based on the message, where the message is forwarded in the embodiment processing 
a stored message]. 

17. Claim 9 is rejected using the same rationale as in the previous Office action that 
was mailed November 20, 2002 as paper 3, and is reproduced here: 

Claim 9 is set forth including the limitations of claim 1 . McDonough and Epstein 
describes and make obvious those limitations as indicated there. Because 
McDonough's embodiments are directed equally to either processing of phone calls or 
processing of stored messages, McDonough describes: 

the voice message is received over a telephone line [at column 2, line 19, as 
speech over the telephone]. 

18. Regarding claim 1 0, McDonough describes the claimed limitations as a whole 
recognizable to one versed in the art as the embodiment for processing untranscribed 
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speech by describing tlie content and functionality of tlie recited limitations recognizable 
as a whole to one versed in the art as the following terminology: 

voice representations and voice information from a person [at column 6, lines 23- 
29, as untranscribed speech data, where at column 2, lines 25-26, the user speaks 
naturally]; 

storing voice, corresponding to a word or phrase [at column 2, lines 1-17, as 
training words to the vocabulary, and at column 5, lines 47-48, as a vocabulary of words 
and phrases for speech events]; 

each voice representation is associated with a value [at column 6, lines 41-42, as 
parameter values for individual event distributions]; 

storing actions [at column 2, lines 14-17, as create a new node associating an 
action with a word]; receive voice information from a person over a communications line 
[at column 2, lines 18-19, as conversational speech over the telephone]; 

selecting a user specified word or a user specified phrase by a user, the selected 
user specified word or phrase corresponding to a word or phrase having a 
corresponding stored voice representation (column 12 line 13 events are selected by 
human operator from list of possible events, see figure 5 as well.) 

analyze the voice message to determine if one or more stored voice 
representations corresponding to the selected user word of phrase occur in the 
message [at column 5, lines 43-50, as process a spoken message to produce a signal 
for the potential speech events in the spoken data If user selection is used as described 
column 12 line 13, then it is inherent that the models will correspond to the selection]; 
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generate a final criteria measurennent value associated with the voice information 
[at column 7, lines 28-44, as summing confidence scores over the speech data]; 

the final criteria measurement value based on the value associated with each 
determined stored voice representation occurring in the voice message [ column 6 line 
4-42, models are trained and parametric probabilistic models and parameter values are 
developed for stored representations. Column 6 line 1 the topic classifier uses model 
parameters determined in training. Therefore it is inherent that the confidence scores 
will be determined in part by this probabilistic parameters.]; 

perform actions if the voice information includes a stored voice representation [at 
column 12, lines 28-41, as respond to, route, or classify the phone call or incoming 
voice message using the sorting for detection of speech data of interest]; 

performing the stored action based on the final criteria measurement value [at 
column 12, lines 28-41, as sort, classify or route based on the topic, wherein at column 
5, line 64-column 6, line 1 the topic choice is a confidence score that a topic is present]. 

McDonouqh does not specifically teach that the selected word is received from 
the user as opposed to be being selected only. 

In the same field of topic determination, Epstein teaches that a user can input 
key words into the device as a part of programming (column 12 lines 18-37.) 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
of the invention to combine the input means of Epstein as a way of performing the 
selection in McDonouqh in order to allow the user to select the words without having to 
user cumbersome lists or menus. 
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19. Claim 14 is setfortli including the limitations of claim 10 and with additional 
limitations similar to limitations set forth in claim 5. McDonough and Epstein describes 
the limitations as indicated there. 

20. Claim 1 5 is set forth including the limitations of claim 1 0.+- McDonough and 
Epstein describes those limitations as indicated there. McDonough also describes 
additional limitations as follows: 

receiving voice information during a call [at column 12, lines 37-38, as spoken 
message by a phone call from a caller]; 

compiling statistics on the call [at column 7, lines 46-47, as compute the scoring 
statistic given the data in the message]. 

21 . Claim 16 is set forth including the limitations of claim 10 and with additional 
limitations already described there. 

22. Regarding claim 17, McDonough describes the claimed limitations as a whole 
recognizable to one versed in the art as the embodiment for processing untranscribed 
speech by describing the content and functionality of the recited limitations recognizable 
as a whole to one versed in the art as the following terminology: 

voice representations and voice messages [at column 6, lines 23-29, as 
untranscribed speech data]; 
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storing voice, corresponding to a word or plirase [at column 2, lines 1-17, as 
training words to the vocabulary, and at column 5, lines 47-48, as a vocabulary of words 
and phrases for speech events]; 

storing actions [at column 2, lines 14-17, as create a new node associating an 
action with a word]; 

receive a voice message [at column 1, lines 53-54, as provide an input speech 
message]; 

selecting a user specified word or a user specified phrase by a user, the selected 
user specified word or phrase corresponding to a word or phrase having a 
corresponding stored voice representation (column 12 line 13 events are selected by 
human operator from list of possible events, see figure 5 as well.) 

analyze the voice message to determine if one or more stored voice 
representations corresponding to the selected user word of phrase occur in the 
message [at column 5, lines 43-50, as process a spoken message to produce a signal 
for the potential speech events in the spoken data If user selection is used as described 
column 12 line 13, then it is inherent that the models will correspond to the selection]; 

generate a final criteria measurement value associated with the voice message 
[at column 7, lines 28-44, as summing confidence scores over the speech data]; 

the final criteria measurement value based on the value associated with each 
determined stored voice representation occurring in the voice message [ column 6 line 
4-42, models are trained and parametric probabilistic models and parameter values are 
developed for stored representations. Column 6 line 1 the topic classifier uses model 
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parameters determined in training. Tlierefore it is inlierent tliat tlie confidence scores 
will be determined in part by this probabilistic parameters.]; 

each voice representation is associated with a final criteria measurement value 
[at column 7, lines 28-44, as putative words and phrases with confidence scores are 
summed over the speech data]; 

perform one (or more) action(s) if the stored voice representations are found in 
the voice message [at column 2, lines 1-8, as route the message according to the action 
associated with the ' word]; 

performing the (stored) action based on the final criteria measurement value [at 
column 12, lines 28-41, as sort, classify or route based on the topic, wherein at column 
5, line 64-column 6, line 1 the topic choice is a confidence score that a topic is present]; 

a storage device for storing the parameters associated with the claimed 
functionality [at column 12, line 2, as the internal structure of the event detector, for the 
example at column 2, lines 1-9, the word nodes and action nodes]; 

a processor for accomplishing the claimed functionality [at column 5, lines 45-46, 
as a speech event frequency detector]. 

McDonouqh does not specifically teach that the selected word is received from 
the user as opposed to be being selected only. 

In the same field of topic determination, Epstein teaches that a user can input 
key words into the device as a part of programming (column 12 lines 18-37.) 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
of the invention to combine the input means of Epstein as a way of performing the 
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selection in McDonougli in order to allow the user to select the words without having to 
user cumbersonne lists or menus. 

23. Claim 20 is set forth including the limitations of claim 1 7. McDonough and 
Epstein describes the limitations as indicated there. McDonough also describes 
additional limitations as follows: 

after receiving the voice message, receiving the user-specified word or phrase 
from the user (column 12 lines 27-42, topic analysis can be carried out on recordings, 
which could obviously be stored in the system before phrases are selected by the 
user.); and 

after receiving the user-specified word or phrase from the user, performing the 
step of analyzing (this order is inherent for the device to operate. The model parameters 
must be determined before analyzing can take place). 

24. Claim 21 is rejected using the same rationale as in the previous Office action 
(mailed November 20, 2002 as paper 3), and reproduced here: 

Claim 21 is set forth including the limitations of claim 17 and with additional 
limitations similar to limitations set forth in claim 5. McDonough describes the limitations 
as indicated there. McDonough [at column 2, lines 17-28] receives input from the user 
for establishing user selection of words and actions. 

McDonough . however, does not explicitly describe an interface between the user 
and the speech event frequency detector. 
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Epstein [at column 8, lines 23-34] also describes processing a voice message as 
the embodiment for stored audio data. Epstein also describes: 

a user interface [at column 6, lines 7-13, as a programming interface]. 

Although McDonough describes receiving input from the user, McDonough does 
not explicitly describe any means to accept this input. Because McDonough describes 
user input, it would have been obvious to one of ordinary skill in the art of processing 
devices at the time of invention to include Epstein's concept of a programming interface 
with McDonough because that would provide the means for the user to provide the input 
to train McDonough's neural network to the words and actions. 

25. Claim 22 is set forth including the limitations of claim 17 and with additional 
limitations similar to limitations set forth in claim 9. McDonough and Epstein describe 
the limitations as indicated there. 

26. Claims 23, 28, and 29 are set forth with limitations similar to claims 10, 15, and 9. 
McDonough and Epstein describes the limitations as indicated there. McDonough also 
describes additional limitations as follows: 

a storage device for storing the parameters associated with the claimed 
functionality [at column 12, line 2, as the internal structure of the event detector, for the 
example at column 2, lines 1-9, the word nodes and action nodes]; 

a processor for accomplishing the claimed functionality [at column 5, lines 45-46, 
as a speech event frequency detector]. 
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27. Claim 27 is rejected using tlie same rationale as in the previous Office action 
(mailed November 20, 2002 as paper 3), and reproduced here: 

Claim 27 is set forth including the limitations of claim 23 and with additional 
limitations similar to limitations set forth in claims 14 and 21 . McDonough and Epstein 
describe and make obvious the limitations as indicated there. 

28. Claim 47 is set forth with limitations similar to limitations set forth in claim 1 . 
McDonough and Epstein describes the limitations as indicated there. McDonough also 
describes additional limitations as follows: 

means for storing the parameters associated with the claimed functionality [see 
Fig. 1, items 20, 22, and their descriptions especially at column 12, line 2, of the internal 
structure of the event detector, for the example at column 2, lines 1 -9, the word nodes 
and action nodes]; 

means for receiving and analyzing a voice message and accomplishing the 
claimed functionality [see Fig. 1, items 10, 12, 16, 18, and their descriptions, especially 
at column 5, lines 45-46, of a speech event frequency detector, topic classifier and 
classifier output]. 

29. Claim 48 is set forth with limitations similar to limitations set forth in claim 23. 
McDonough and Epstein describes the limitations as indicated there, where the storage 
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device and tlie processor are tlie means for storing, means for receiving, and means for 
analyzing. 



30. Claim 51 is set forth with limitations similar to limitations that are also set forth in 
claim 1 . McDonough and Epstein describes the limitations as indicated there. 

McDonough [at column 5, lines 45-46] also describes a processor for 
accomplishing the claimed functionality. 

McDonough . however, does not explicitly describe that the speech event 
frequency detector is computer-implemented and with computer-readable contents. 

Epstein [at column 8, lines 23-34] also describes processing a voice message as 
the embodiment for stored audio data. Epstein describes: 

a computer readable medium whose contents cause the computer to perform the 
procedure [at column 4, lines 4-30, as associated memory for software implemented on 
a computer to accomplish the functionality]. 

To the extent that McDonough's system does not necessarily contain typical 
computer hardware and software, it would have been obvious to one of ordinary skill in 
the art of implementing functional descriptions of operations at the time of invention to 
include Epstein's concept of computer implementations by software loaded in computer- 
readable memory to achieve McDonough's speech processing functionality because 
that would have provided the best implementation under particular circumstances 
identified and evaluated by a skilled artisan. For example, it is within the ordinary skill of 
an artisan to determine that software elements, such as Epstein's concept, benefits 
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changing processing functions or adding otiier processing functions because software 
elements are more easily modified than hardware elements. 

31 . Claim 52 is rejected using the same rationale as in the previous Office action 
(mailed November 20, 2002 as paper 3), and reproduced here: 

32. Claim 52 is set forth with limitations similar to limitations set forth in claim 23 and 
with additional limitations similar to limitations set forth in claim 51 . McDonough and 
Epstein describe and make obvious the limitations as indicated there. 

McDonouph Epstein and Furui 

33. Claims 2, 3 ,1 1 ,1 2, 1 8, 1 9, 24, 25, 31 , 33-34, 38, 42, 45, and 53-54 are rejected 
under 35 U.S.C. 103(a) as being unpatentable over McDonough et al. [US Patent 
5,625,748] and Epstein [US Patent 6,327,343] in view of Sadaoki Furui , "Digital Speech 
Processing, Synthesis, and Recognition," Marcel Dekker, Inc., New York, 1989, pp. 
225-289, both already of record. 

34. Claim 2 is rejected using the same rationale as in the previous Office action 
(mailed November 20, 2002 as paper 3), and reproduced here: 

Claim 2 is set forth including the limitations of claim 1 . McDonough and Epstein 
describes those limitations as indicated there. McDonough [at column 7, lines 26-48] 
also describes phonetic wordspotting for the preferred embodiments. 

Furui describes: 
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a voice message [at page 226, lines 19-22, as speecli waveforms]; 

eacli stored voice representation is a plioneme representation of a word or 
plirase [at page 244, lines 1-4, as reference templates use phonemes concatenates to 
represent words]. 

Although, McDonough describes phonetic wordspotting, McDonough does not 
explicitly describe phoneme models. 

To the extent that McDonough's stored voice representations of words are not 
necessarily phoneme representations, it would have been obvious to one of ordinary 
skill in the art of speech recognition at the time of invention to include Furui's phoneme 
based lexicon for wordspotting as McDonough's trained vocabulary, because 
McDonough points out phonetic wordspotting as preferred. 

35. Claim 3 is rejected using the same rationale as in the previous Office action 
(mailed November 20, 2002 as paper 3), and reproduced here: 

Claim 3 is set forth including the limitations of claims 1 -2. McDonough Epstein 
and Furui describe and make obvious those limitations as indicated there. McDonough 
[at column 1 1 , lines 9-1 1 ] also describes implementing algorithms in the C 
programming language for computing. 

McDonough and Furui, however, do not explicitly describe digital conversion of 
analog signals, [at column 8, lines 23-34] also describes processing a voice message 
as the embodiment for stored audio data. Epstein describes: 

a voice message [at column 8, lines 33-35, as stored audio data]; 
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converting tlie analog voice message from analog to digital [at column 7, lines 1- 
5, as convert the analog data, such as an analog recorder, into digital data]; and 

processing the digitized voice message [at column 9, lines 40-67, as convert 
voice data]. 

To the extent that McDonough's data is not innately digitized for the suggested 
computer algorithms, it would have been obvious to one of ordinary skill in the art of 
speech processing at the time of invention to include Epstein's analog to digital 
conversion for McDonough's data or Furui's data because the digital data could be 
processed on general purpose digital computers or programmable digital signal 
processors. 

For the digital data then, Furui describes: 

processing the voice message into phonemes [at page 244, lines 8-28, as short 
periods of input speech with phoneme-template structure are compared to phoneme 
reference templates to represent each word by concatenation of phonemes]; and 

comparing the phonemes from the voice message with stored voice 
representations [at page 244, lines 42-44, as match the same phoneme positions 
between the input speech and reference templates]. 

36. Claim 1 1 is rejected using the same rationale as in the previous Office action 
(mailed November 20, 2002 as paper 3), and reproduced here: 
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Claim 11 is setfortli including the limitations of claim 10 and with additional 
limitations similar to limitations set forth in claim 2. McDonough and Epstein and Furui 
describe and make obvious the limitations as indicated there. 

37. Claim 12 is rejected using the same rationale as in the previous Office action 
(mailed November 20, 2002 as paper 3), and reproduced here: 

Claim 12 is set forth including the limitations of claims 10-1 1 and with additional 
limitations similar to limitations set forth in claim 3. McDonough . Furui . and Epstein 
describe and make obvious the limitations as indicated there. 

38. Claim 18 is rejected using the same rationale as in the previous Office action 
(mailed November 20, 2002 as paper 3), and reproduced here: 

Claim 18 is set forth including the limitations of claim 17 and with additional 
limitations similar to limitations set forth in claim 2. McDonough and Epstein and Furui 
describe and make obvious the limitations as indicated there. 

39. Claim 19 is rejected using the same rationale as in the previous Office action 
(mailed November 20, 2002 as paper 3), and reproduced here: 

Claim 19 is set forth including the limitations of claims 17-18 and with additional 
limitations similar to limitations set forth in claim 3. McDonough . Furui, and Epstein 
describe and make obvious the limitations as indicated there. Epstein also describes 



Application/Control Number: 09/459,380 Page 24 

Art Unit: 2626 

further limitations as follows: an analog to digital converter [at column 7, lines 1-5, as an 
analog-to-digital converter]. 

40. Claim 24 is rejected using the same rationale as in the previous Office action 
(mailed November 20, 2002 as paper 3), and reproduced here: 

Claim 24 is set forth including the limitations of claim 23 and with additional 
limitations similar to limitations set forth in claim 2. McDonough and Epstein and Furui 
describe and make obvious the limitations as indicated there 

41 . Claim 25 is rejected using the same rationale as in the previous Office action 
(mailed November 20, 2002 as paper 3), and reproduced here: 

Claim 25 is set forth including the limitations of claims 23-24 and with additional 
limitations similar to limitations set forth in claim 12. McDonough . Furui . and Epstein 
describe and make obvious the limitations as indicated there. Epstein also describes 
further limitations as follows: 

an analog to digital converter [at column 7, lines 1-5, as an analog-to-digital 
converter]. 

42. Claim 31 is rejected using the same rationale as in the previous Office action 
(mailed November 20, 2002 as paper 3), and reproduced here: 
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Claim 31 is set fortli including the limitations of claim 30 and with additional 
limitations similar to limitations set forth in claim 3. McDonough . Furui . and Epstein 
describe and make obvious the limitations as indicated there. 

43. Claims 33 and 34 are set forth including the limitations of claim 30 and with 
additional limitations similar to limitations set forth in claims 6 and 7. Neither 
McDonough nor Furui explicitly describes the additional limitations of claims 6 and 7; 
however, McDonough . Furui, and Epstein describe and make obvious the limitations as 
indicated there. 

44. Claim 38 is rejected using the same rationale as in the previous Office action 
(mailed November 20, 2002 as paper 3), and reproduced here: 

Claim 38 is set forth including the limitations of claim 37 and with additional 
limitations similar to limitations set forth in claim 12. McDonough . Furui, and Epstein 
describe and make obvious the limitations as indicated there. 

45. Claim 42 is set forth including the limitations of claim 41 and with additional 
limitations similar to limitations set forth in claim 21 . Neither McDonough nor Furui 
explicitly describes the additional limitations of claim 21; however, McDonough . Furui, 
and Epstein describe and make obvious the limitations as indicated there. 
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46. Claim 45 is set fortli including the limitations of claim 44 and with additional 
limitations similar to limitations set forth in claim 27. Neither McDonough nor Furui 
explicitly describes the additional limitations of claim 27; however, McDonough . Furui . 
and Epstein describe and make obvious the limitations as indicated there. 

47. Claim 53 is set forth with limitations similar to limitations set forth in claim 30 and 
with additional limitations similar to limitations set forth in claim 51. Neither McDonough 
nor Furui explicitly describes the additional limitations of claim 51; however, 
McDonough . Furui, and Epstein describe and make obvious the limitations as indicated 
there. 

48. Claim 54 is set forth with limitations similar to limitations set forth in claim 37 and 
with additional limitations similar to limitations set forth in claim 51. Neither McDonough 
nor Furui explicitly describes the additional limitations of claim 51 ; however, 
McDonough . Furui, and Epstein describe and make obvious the limitations as indicated 
there. 

McDonouph and Furui 

49. Claims 30, 32, 35-37, 39-41 , 43-44, 46, and 49-50 are rejected under 35 U.S.C. 
103(a) as being unpatentable over McDonough et al. [US Patent 5,625,748] and 
Epstein [US Patent 6,327,343] in view of Sadaoki Furui, "Digital Speech Processing, 
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Synthesis, and Recognition," Marcel Del<l<er, Inc., New York, 1989, pp. 225-289, both 
already of record. 

50. Regarding claim 30, McDonough describes the claimed limitations as a whole 
recognizable to one versed in the art as the embodiment for processing untranscribed 
speech comprising: 

storing actions [at column 2, lines 14-17, as create a new node associating an 
action with a word]; 

receive a voice message [at column 1, lines 53-54, as provide an input speech 
message]; 

speech [at column 6, lines 23-29, as untranscribed speech data]; 

predetermined patterns of speech [at column 7, lines 27-37, as HMMs from 
training and modeling]; 

analyze the voice message to determine if it exhibits a predetermined pattern of 
speech [at column 5, lines 43-50, as process a spoken message to produce a signal for 
the potential speech events in the spoken data]; 

perform actions if the predetermined pattern is found in the voice message [at 
column 2, lines 1-8, as route the message according to the action associated with the 
word]. 

Although, McDonough [at column 7, lines 27-44] describes spotting the words 
and phrases of the speech data using phonetically trained HMMs for the preferred 
embodiments, McDonough describes using HMMs for this method as known 
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techniques. Consequently, McDonougli does not describe details of the techniques. In 
particula r. McDonough does not explicitly describe HMMs representing either a tone of 
speech or a frequency of speech. 

Furui [at page 255, lines 29-38 & page 258, lines 16-18] describes widely 
investigated word modeling by phonetic HMMs and that feature vectors are applied in 
HMMs. Furui describes: 

the predetermined pattern representing a tone of speech in the voice message 
[at page 8, lines 1 -1 5 and Fig. 8.15, as a lattice taking account of allophones, 
coarticulation, stress, and syllables]; 

the predetermined pattern representing a frequency (or other) of the speech in 
the voice message [at page 278, lines 3-9, as Markov models for recognition of input 
speech converted into spectral feature vectors by DFT]. 

In view of the teachings of Furui about the essential nature of voice containing 
frequency and tone, McDonouqh's stored voice representations must represent the 
frequency and tone of voice; however, to the extent that McDonouqh's stored voice 
representations of phonemes, words, and phrases may not innately represent frequency 
(or tone), it would have been obvious to one of ordinary skill in the art of speech 
recognition at the time of invention that Furui's DFT produces frequency spectral 
parameters to represent the HMMs suitable for implementing McDonouqh's HMMs for 
word and phrase spotting, because McDonouqh points out HMMs as preferred. 

Although McDonouqh prefers HMM representations for the voice, McDonouqh's 
omission of particular details regarding HMMs is due to, and is evidence of, the lack of 
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any need for one of ordinary sl<ill in tlie art of pattern matcliing to be reminded of sucli 
details. 

51 . Claim 32 is set forth including the limitations of claim 30 and with additional 
limitations similar to limitations set forth in claim 5. McDonough and Furui describe and 
make obvious the limitations as indicated there. 

52. Claim 35 is set forth including the limitations of claim 30 and with additional 
limitations similar to limitations set forth in claim 8. McDonough and Furui describe and 
make obvious the limitations as indicated there. 

53. Claim 36 is set forth including the limitations of claim 30 and with additional 
limitations similar to limitations set forth in claim 9. McDonough and Furui describe and 
make obvious the limitations as indicated there. 

54. Claim 37 and claims 39 and 40 are set forth with limitations similar to claim 30 
and with limitations similar to limitations set forth in claims 1 4 and 16. McDonough and 
Furui describe and make obvious the limitations as indicated there, where a stored 
voice representation is a predetermined pattern of speech. 

55. Claim 41 and claim 43 are set forth with limitations similar to limitations set forth 
in claim 30 and claim 22. McDonough and Furui describe and make obvious the 
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limitations as indicated tliere. McDonougli also describes additional limitations as 
follows: 

a storage device for storing the information associated with the claimed 
functionality [at column 12, line 2, as the internal structure of the event detector, for the 
example at column 2, lines 1-9, the word nodes and action nodes]; 

a processor for accomplishing the claimed functionality [at column 5, lines 45-46, 
as a speech event frequency detector]. 

56. Claim 44 and claim 46 are set forth with limitations similar to limitations set forth 
in claim 37 and claim 22. McDonough and Furui describe and make obvious the 
limitations as indicated there. McDonough also describes additional limitations as 
follows: 

a storage device for storing the information associated with the claimed 
functionality [at column 12, line 2, as the internal structure of the event detector, for the 
example at column 2, lines 1-9, the word nodes and action nodes]; 

a processor for accomplishing the claimed functionality [at column 5, lines 45-46, 
as a speech event frequency detector]. 

57. Claim 49 is set forth with limitations similar to limitations set forth in claims 30 
and 47. McDonough and Furui describe and make obvious the limitations as indicated 
there. 
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58. Claim 50 is set fortli witli limitations similar to limitations set forth in claims 37 
and 48. McDonough and Furui describe and make obvious the limitations as indicated 
there. 

Conclusion 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to DOUGLAS C. GODBOLD whose telephone number is 
(571)270-1451 . The examiner can normally be reached on Monday-Thursday 7:00am- 
4:30pm Friday 7:00am-3:30pm. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Patrick Edouard can be reached on (571) 272-7603. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (BBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 
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